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[57] ABSTRACT 

A cable television (CTV) system having an ad-insertion 
apparatus for automatically inserting commercial segments 
into program material under the control of cue tones trans- 
mitted by the program source. The system includes appara- 
tus for normalizing the audio signal levels of the program 
and commercial materials so that the audio portion of the 
output signal being transmitted to subscribers will have a 
relatively uniform loudness. The same concept may be 
applied to video signals. Additionally, signals coming from 
several channels may be normalized with respect to each 
other using the same technique. One aspect involves nor- 
malization of the audio level of the commercial, based on 
measured levels of the program audio preceding the adver- 
tisement. In other variations, the program audio level is 
adjusted to match a preset audio level of an advertisement 
In another aspect of the invention, the audio level adjustment 
is achieved by monitoring the deviation of an audio modu- 
lator. In general the technique comprises generating com- 
posite CTV output signals in each of a plurality of CTV 
channels by generating a series of program segments and 
cue tones indicating the borders of the program segments 
and a series of commercial segments in response to the cue 
tones. Each CTV channel output is formed by alternately 
linking program segments with commercial segments at the 
borders in response to the cue tones. The channel outputs are 
combined for simultaneous transmission to subscribers. The 
loudness of the segments in one of the CTV channels is 
monitored. Volume attenuators are adjusted in each of the 
channels as a function of the loudness in one of the channels 
such that the loudness of the audio in the channels is 
normalized. 

38 Claims, 12 Drawing Sheets 
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METHOD AND APPARATUS FOR 
NORMALIZING SIGNAL LEVELS IN A 
SIGNAL PROCESSING SYSTEM 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The invention relates to signal processing systems and 
methods and, particularly, to techniques for normalizing 
signal levels in a signal processing system. 

2. Description of the Prior Art 

Many electronic systems configure and format signals by 
linkin g together a series of signal segments obtained from a 
number of different sources. In such systems, it is usually 
important that the signal levels at the different sources be 
matched. Conventional television and radio transmission 
systems are notable examples. 

For example, subscriber television systems, such as cable 
television (CTV), normally deliver programs that are formed 
from a number of successive segments that originate at 
different sources. Many cable channel programmers set 
aside approximately four minutes in two blocks each hour 
for local advertising insertion. These advertisement blocks 
are sold by the local cable operator or by an advertising 
consortium of several cable systems. The cable operator 
automatically inserts the advertising in, for example, a 
satellite-delivered program coming from the programmer. 
The insertion is usually done locally under the control of cue 
tones transmitted by the programmer. At these specific cues, 
the cable operator switches different audio and/or video 
programming. Consequently, cable operators frequently 
encounter the problem of matching the audio and/or video 
levels between the different sources. This problem is par- 
ticularly acute in CTV systems where the system performs 
automatic switching with no human operator to adjust 
levels. 

In many prior art CTV systems, ad-insertion is handled by 
a combination of cue tone detectors, switching equipment 
and tape players which hold the advertising material. Upon 
receipt of the cue tones, a CTV insertion controller auto- 
matically turns on a tape player containing the advertise- 
ment Switching equipment then switches the system output 
from the video and audio signals received from the pro- 
gramming source to the output of the tape player. The tape 
player remains on for the duration of the advertising, after 
which the insertion controller causes the switching equip- 
ment to switch back to the video and audio channels of the 
programming source. When switched, these successive pro- 
gram and advertising segments usually feed to a radio- 
frequency (RF) modulator for delivery to the subscribers. 

Many subscriber television systems, such as CTV 
systems, are currently being converted to digital equipment 
In the future, video file-server systems will replace many of 
the conventional tape players. These new digital systems 
compress the advertising da t a, e.g., using Motion Picture 
Experts Group 2 (MPEG2) compression, store the com- 
pressed data as a digital file on a large disk drive (or several 
drives), and then, upon receipt of the cue tone, spool 
("play") the file off the drive to a decompressor. The video 
and accompanying audio data are decompressed back to 
standard video and audio, and switched into the video/audio 
feed of the RF modulator for delivery to the subscribers. 

One of the most critical problems confronting designers 
of CTV systems and other similar transmission systems, has 
been normalizing the audio and video levels between the 
programming and the advertising. It is generally known that 



12,018 

2 

many subscribers have complained for years that the audio 
sounds higher during commercials than during program- 
ming. Although the audio during commercials can also 
sound low compared to the program level, few people 

5 complain in that case. 

Consequently, those concerned with the development of 
radio, television, data, control and equivalent transmission 
systems have recognized the need for more effective signal- 
level normalization techniques. The present invention ful- 

io fills this need. 

SUMMARY OF THE INVENTION 

Therefore, it is an object of the invention to provide an 
improved signal-level normalization technique for use in 
15 signal transmissions such as audio, video and data transmis- 
sions. 

It is another object of the invention to provide a signal- 
level normalization technique particularly suitable for use in 
digital networks. 

20 A further object of the invention is the provision of 
systems and methods of normalizing the signal level in a first 
signal block, such as the audio and/or video levels in a 
television program or a commercial, based on comparable 
measured levels of a second signal block, such as a prior 

25 program or a preceding commercial. 

Still another object of the invention is the provision of 
systems and methods of normalizing the signal level in a 
signal block based on previously measured levels of the 

30 signal block. 

Yet a further object of the present invention is the provi- 
sion of a signal insertion technique capable of performing 
quality checks, such as verifying that the proper commercial 
was inserted into a predetermined location in the advertising 

35 block of a program 

According to the invention, a signal processing system 
having a normalized output signal comprises a first signal 
source and a second signal source. A signal combiner 
connects to the first and second signal sources for forming 

40 an output signal by linking signal segments derived from the 
first and second signal sources into a series of the signal 
segments. A level processor connects to the signal combiner 
for determining a level of intensity of the output signal. A 
level adjuster connects to at least one of the signal sources 

45 and responds to the level processor for adjusting a level of 
intensity of the signal segments at least one of the signal 
sources such that the level of intensity of the output signal 
is normalized. 

More specifically, the invention provides a signal trans- 

50 mission system for producing a composite output signal 
formed by linking signals from a plurality of signal sources 
comprising a first signal source for generating a series of first 
segments and cue tones indicating the borders of the first 
signal segments, and a second signal source connected to the 

55 first signal source. Responsive to the cue tones, the second 
signal source generates a series of second signal segments. 
The first and second signal segments include audio signals. 
A signal combiner is connected to the first and second signal 
sources for forming the composite output signal by alter- 

60 nately linking the first signal segments with the second 
signal segments. A level processor connects to the signal 
combiner for determining the loudness of the audio portion 
of the composite output signal. A level adjuster connects to 
at least one of the signal sources and responds to the level 

65 processor for adjusting the loudness of signal segments from 
the signal sources such that the volume of the output signal 
is normalized. 
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Still another aspect of the invention involves a signal of FIG. 13 with a conventional A-weighting curve which 

transmission method for transmitting an output signal by closely resembles me frequency response of a typical human 

generating composite output signals in each of a plurality of ear. 

signal channels. The composite output signals in each chan- DETAILED DESCRIPTION OF THE 

nel is formed by generating a series of first signal segments 5 PREFERRED EMBODIMENTS 

and cue tones indicating the borders of the first signal _ _ . ^ . , t « „ If . 0 _ ■ 

« - r j • * & . Referring now to the drawings, FIG. 1 shows a general 

segments, and generating a series of second signal segments t , , , e , Zis! T * • * u a * a *u ♦ 

in response to toe cue tones. Each channel ou*ut is formed Week diagram ofCTV system 20 . » is to be understood^ 

by alternately Unking the first signal segments with the the parucular CTV systems described herein are exemp ary 

secondsignalsegmenteatthebordeninres^onsetothecues 10 and that flie invention has application m other equivalent 

tones. niechanneloutputsfromeachchannelarecombined aud,0 ^ eo - *** "5* equivalent transmission sys- 

forstoultaneous^missiontousers-Themethodincludes tems ^ CTV system 20 includes earth station receiver 21 

mestepofdeteiminmgalevelofmtensityofsignalsinafirst P™ des F T^^Hi.™^^ 

channel output and adjusting a level of intensity of each of audl0 >. at lts outout u , nes 2Z *«* s ^ on receiver » 

the channel outputs as a function of the intensity in the first is com Pf es «>nventK>nal receiver and decoder equipment. A 

channel outputTuch that the levels of intensity of the channel T^J***^ 2 • ?°T^ T T^KS « 

outputs are normalized. * e Natl0nal ™ e ™ 10n S ? S T (^Qformat), 

T plus one to three channels of audio. The audio channels are 

These and other objects features and aspects of die mQst oommoill left . md righ t^hannel stereo and possibly 

invention wiU be more clearly understood and better a ^ monaur > channd fa ft second audio (SAp)? 

described if the following detailed description is read in 20 ^ ^ e Many cable operators contem- 

conjunction with the appended drawings wherein: ^ replacmg mese analog systems with more modern 

BRIEF DESCRIPTION OF THE DRAWINGS digital CTV systems that output digital video and/or audio 

t^t^ + • j 1 1 j. - ui * 1 • • signals from receiver 21. 

FIG. 1 is a system block diagram of a cable television ° T A A A . u t . A 
« , . , ' j **u fj-xln addition to the video and audio channels, receiver 21 

head-end system constructed in accordance with a preferred 25 ♦ ^ 

, . ir • transmits cue tones on line 23 to ad-insertion system 24 

embodiment of the invention. which ^ ±& ^ ^ ^ genefate signals f()r 

FIG. 2 is a graph of loudness of an audio signal on a controlling me insertion of advertising into the program 

logarithmic scale vs. time, which is useful in understanding materia!, most systems, the cue tones are a series of 

the preferred embodiment of FIG. 1. ^ dual-tone multiple-frequency (DTMF) tones which identify 

FIG. 3 is a detailed block diagram of a cable television the programmer and the insertion times in the ad-insertion 

head-end system constructed in accordance with the pre- process. Typically, different cues are transmitted for pre-roll 

ferred embodiment of FIG. 1. (an advance in time to allow a tape machine to get up to 

FIG. 4 is a graph, similar to that of FIG. 2, showing speed), transfer-to-ad (a time at the beginning of the adver- 

loudness of an audio signal on a logarithmic scale vs. time, 35 tisement block) and return (the conclusion of the advertise- 

which is useful in understanding the preferred embodiment ment block). 

of FIGS. 1 and 3. Timin g signals, which ad-insertion system 24 outputs 

FIG. 5A is a flow chart illustrating process steps per- onto lines 34, operate transmission switch 25. Ad-insertion 

formed by the preferred embodiment of FIGS. 1 and 3. system 24 generates these timing signals in response to the 

FIG. 5B is a flow chart, similar to that of FIG. 5A, 40 cue tones received on line 23. When operated, switch 25 

illustrating alternate process steps performed by the pre- routes either the program audio and video, which appear on 

ferred embodiment of FIGS. 1 and 3. lines 22, or the advertising video and audio, which appear on 

FIG. 6 is a detailed block diagram, similar to that of FIG. ^ to modulator 30 via lines 33. Modulator 30 modu- 

3, of an alternate embodiment of the invention. lates me and onto an carrier mat u transimts 

FIG. 7 is a graph, similar to that of FIG. 4, showing 45 to ^bscribers along with a^ 

loudness of an ££> signal on a logarithmic scale vs. time Modulate » wm oft ™ mdl f ?5 

wmch is useful mundefs^^ receives ^d processes the signals, such as scramblers and 

™- « . ^ , _ .„ _ f! . j stereo encoders or the like. 

y 50 23, ad-inseruon system 24 automatically inserts commer- 

FIG. 9 is a system block diagram, similar to that of FIG. cials int0 advertising blocks by spooling advertising material 

1, of a cable television system constructed in accordance onto Unes 2? an(L hence modulator 30 via switch 25 and 

with another alternate embodiment of the invention. ^ 33 To puMdc normalization between the audio and/or . 

FIG. 10 is a detailed block diagram, similar to that of FIG. y^o i eve i s 0 f ^ commercials with that of the program, 

6, of the alternate embodiment of the invention shown in 55 level proce ssor circuit 26 monitors the signals being 

FKj. 9. switched onto lines 33. Level processor circuit 26 adjusts, 

FIG. 11 is a detailed block diagram of a commercial v i a lines 3^ the appropriate signal levels of the advertising 

verification system in accordance with the present invention. material being outputted by ad-insertion system 24. 

FIG. 12 is a flow chart illustrating process steps per- as described above, a primary problem addressed by this 

formed by the system of FIG. 11. 60 invention involves correcting a possible mismatch between 

FIG. 13 is a detailed circuit schematic with parameter the audio level coming from receiver 21 and that coming 

values for the various elements thereof, representing a from ad-insertion system 24. Each apparatus represents a 

particular implementation of a level detector that forms a source of audio that is prepared at a different location by 

part of the various preferred embodiments of the present different people using different equipment While efforts 

invention. 65 have been made in the past to normalize the audio levels 

FIG. 14 is a graph showing a plot of response in decibels from these different sources, limited success has been 

(dB) vs. frequency for comparing test results for the circuit achieved. 
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Topically, a level adjustment device resides within audio switches 52, 53 and 54, being suitably switched by 

ad-insertion system 24 to allow operating personnel to signals on lines 34. One input side of video switch 51 

manually or electronically correct for gain variations in the connects to program video source 21V. The other input side 

equipment. These level adjustment devices are often a of video switch 51 connects to advertisement video source 

source of possible errors due to frequent miss-adjustments. 5 24V. The output side of video switch 51 connects to video 

Still further, the gain of various pieces of system equipment modulator 30V. 

will often shift with time and temperature. Q ne mpu t side of each of audio switches 52-54 connects 

Yet another problem in the CTV industry is that the level to respective left right and SAP channels of program audio 

coming from receiver 21 may be inconsistent from one source 21A. The other input sides of audio switches 52-54 

channel to another. While most receivers and decoders used 10 connect to the respective left, right and SAP channels of 

in the industry have fixed output levels for a fixed input at advertisement audio source 24A via respective attenuators 

the uplink, different programmers operate uplinks with dif- 55, 56 and 57 which function as audio level controls. The 

fering input levels. This is partially because different types output sides of audio switches 52-54 connect to stereo 

of program material require different amounts of headroom, encoder modulator 30A. 

and different uplink engineers have different philosophies 15 The output sides of audio switches 52 and 53 also connect 

about the amount of headroom to allow. Because of this, to adder 39 of level processor circuit 26 via lines 33. 

level adjustments in ad-insertion system 24 have become Because the preferred manner of evaluating the level of a 

necessary. stereo signal is to evaluate the sum of its left and right 

FIG. 2, which plots loudness vs. time, illustrates the channels, adder 39 sums these stereo signals. Level detector 

problem graphically. This graph shows the relative audio 20 40 receives the summed output of adder 39 and inputs the 

levels of four consecutive segments consisting of two pro- inverting (-) side of comparison amplifier 32. Level detector 

gram segments PI and P2, and an advertising block with two 40 receives the audio signals at its input, and outputs a 

commercials CI and C2 located therebetween. Program voltage corresponding to the loudness. The setting of switch 

segments PI and P2 and commercials CI and C2 have their 25 will determine which audio signals adder 39 sums. With 

audio levels varying over time. The first program segment 25 respect to the graph of FIG. 2, the output of level detector 40 

PI extends between times tl and t3. An advertising block, will be a voltage that varies, preferably as a decibel (dB) 

starting with commercial CI, follows program segment PI. function, in accordance with the four curves representing the 

The graph shows commercial CI having a higher loudness loudness of program segments PI and P2 and commercials 

than did the preceding program segment PI. Next in the CI and C2. 

advertising block is second commercial C2. Because it was 30 Audio reference level circuit 41, represented by a Zener- 

recorded differently than was first commercial CI, the graph diode voltage source, provides a fixed reference level LI at 

shows it playing louder than commercial CI. The successive the (+) input to comparison amplifier 32. Thus, comparison 

commercials CI and C2 extend between times t3 and t4. amplifier 32 outputs the difference between the actual vol- 

Finally, second program segment P2 follows second com- U me level of the audio source, as detected by level detector 

mercial C2, and plays at the lower loudness level of program 40, and the reference volume level LI which has been preset 

segment PI. in reference level circuit 41. Analog-to-digital (A/D) con- 

The variations in loudness illustrated in FIG. 2 usually verter 44 samples the output of comparison amplifier 32. 

produce me irritating phenomenon sometimes called "blast- Consequently, A/D converter 44 outputs these sampled 

ing" out the commercial. As mentioned above, broadcasters ^ differences to logic 42 as a series of digital words propor- 

and cable operators have been accused of doing this inten- tional to a dB audio level. Logic 42 communicates with a 

tionally to gain listener attention for the commercials. memory 43. As will be described below in detail, logic 42, 

However, usually the source of the problem is more mun- which monitors the output of A/D converter 44, adjusts 

dane: either the commercial automatically plays and no one attenuators 55-57 via attenuator control lines 31 in accor- 

is available to correct the audio level, or metering problems 45 dance with error values calculated by logic 42. To perform 

make an objective comparison of the levels difficult. its level adjusting functions, logic 42 derives control and 

FIG. 2 also shows the transmission times of the cue tones. timing signals from insertion controller 46, which is a 

The pre-roll cue tone arrives at time t2, which is usually conventional part of ad-insertion system 24. 

about five to eight seconds before the commercial starts at In response to the cue tones transmitted from program 

time t3, to allow tape equipment to get up to speed before 50 video source 21V on line 23, insertion controller 46 operates 

being played at time t3. A transfer cue tone arrives at time transmission switch 25 and sends control signals to logic 42, 

t3 to insert first and second commercials CI and C2 into the video source 24V and audio source 24A. Further, insertion 

advertising block. Finally, a return cue tone arrives at the end controller 46 secures appropriate synchronization (sync) 

of the commercial break, time t4, to instruct the system to pulses and timing signals from sync separator and timing 

return to the program material supplied from receiver 21. 55 circuit 47, which is also a part of conventional ad-insertion 

Referring now to FIGS. 1-4, a normalization technique in system 24. Advertisement video source 24V uses the output 

which the advertisement audio level is adjusted to match the of circuit 47 to synchronize its video with program video 

level of the program audio will be described in detail. FIG. source 21V. Additionally, insertion controller 46 acquires 

3 depicts receiver 21 as having two channels, namely appropriate timing signals from timing circuit 47 so that it 

program video source 21V and program audio source 21A. $o can initiate spooling of the commercial when the appropriate 

Modulator 30 includes video modulator 30V and stereo pre-roll cue tone on line 23 is received, for example, at time 

encoder modulator 30A. FIG. 3 shows ad-insertion system t2 of FIG. 2. Still further, insertion controller 46 operates 

24 of FIG. 1 as having audio and video channels designated transmission switch 25, via switch control lines 34, in 

as advertisement video source 24V and advertisement audio accordance with the appropriate cue tones transmitted on 

source 24A. 65 ^ ne 23- 

FIG. 3 schematically depicts switch 25 as comprising a It is noted that the SAP channel, if supplied, is controlled 

plurality of ganged switches, namely video switch 51 and separately, based on its own characteristics. However, in the 
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interest of clarity, this description and the related drawings In calculate STEP 76, logic 42 calculates audio level 

omit the details of the SAP controls. However, it will be errors En based on the current value of the target level and 

readily understood by those skilled in these arts that the SAP the output sample at the start of a commercial. For example, 

channel, and any other audio or data channels, may be immediately after insertion controller 46 transmits transfer 

treated in a similar manner as are the left and/or right stereo 5 trigger, i.e., at time t3+, logic 42 calculates error El as the 

channels described herein. difference between the loudness level at the start of com- 

The operation of CTV system 20 will now be described mercial CI and the target level. Logic 42 next performs 

with reference to the circuit of FIG. 3, the sample curves adjust STEP 77 by applying the current audio level error El, 

depicted in FIG. 4 and the FIG. 5A flow chart As seen in via ft nes $i t0 ad j ust me p reset attenuators 55 and 56 in a 

FIG. 4, the audio level over any length of time is not 10 dkection mat wm cause me audio level of commercial CI to 

constant; typical program audio may range from a whisper moye down to ^ taf t leyel 

to the loud screech of an airplane taking off. Thus, it is reference level T 1 renresents the exDected 

necessary to be precise as to what is meant by the "level" of , " P ractl <f > reference level LI represents me expectea 

J ^ \. Aj . . J - . . . level for all advertisements and is normally obtained by 

a program. For the present description, one preferred strat- j* T T „ • , I • 

. A . i i . ir j £ j. recording (compressine) all commercials at a nominally 

egy is to match the last few seconds of the program audio * \ 7" A , ' 

1 w - 1.* j r *T i: 15 identical audio level equal to level LI. FIG. 4 shows the 

level (right end of program segment PI) with the first few * ^ ?\' * . . , c1 . . 

\ V fU Q c n nn„;Z n nlft on A ~f average difference D and the audio level error El to be 

seconds of the following commercial (left end or commer- , * . „ , . ... _ ^ AU ^ . . 

* i T * a<-> 2 a \+: u « substantially equal, which illustrates the typical situation 

cial CI). Logic 42 performs this function by keeping a ' H . * ' r . , 

. , . rttl .„ . * . where commercial CI was actually recorded at the nominal 

running log in memory 43 of "level" measurements based on ^ 1T 7^ + u *u u a ttt^ a ,u^„ 

i * ♦ • i or expected level LI. On the other hand, FIG. 4 also shows 

an average of the last portion of the program material. At the „ . , ™ . . « - . u . u 

time tl? program plays, this average becomes the "target 20 -C2 as having been recorded slighdy higher than 

lever to which me commercial audio level is matched. me noimnal OT «* cctod level 

More specifically, upon start up of CTV system 20, logic ™ us > * me sequence of pro-am segment PI just 

42 proceeds to monitor the output of A/D converter 44 in P^ 10 commercial time transmits louder than the expected 

monitor STEP 70 (see FIG. 5A) and continues to do so until 25 or nominal level the level of commercial CI will be 

the system is turned off. During the playing of program increased by adjusting attenuators 55 and 56 On the other 

segment PI, transmission switch 25 is in its normally up hand > * ^ audl ° level of program segment PI is lower than 

position shown in FIG. 3. As such, adder 39 will sum the left ^ expected level LI as depicted in FIG 4 logic 42 adjusts 

andright channels of program audio source 21A for program attenuators 55 and 56 to reduce the level of commercial CI 

segment PL Consequently, the output of comparison ampli- 30 down to the target level. 

fier 32 will vary with the dB difference between program Ideally this would be the end of the process. However, as 

segment PI and fixed reference level LI. a F acti cal matter, it is not reasonable to expect that all 

Upon receiving the pre-roll cue tone at time t2, insertion commercials will be recorded at the "right" leveL The 

controller 46 transmits a pre-roll trigger to logic 42 in trigger commercials will be recorded at different times by different 

STEP 71, thereby causing it to store in memory 43 a series 35 individuals using different indicating instruments, usmg 

of output values from A/D converter 44 as indicated by store different audio processing systems, and applying different 

STEP 71. Logic 42 next receives, in trigger STEP 73, a ideas of how to interpret the audio level. Thus, the level of 

transfer trigger from controller 46 at time t3. At this point, ^ commercials will usually not be the same This is 

logic 42 uses the series of previously stored values, in illustrated in FIG. 4 by the difference between the levels of 

calculate STEP 74, to calculate and store the average output 40 commercials CI and CI. 

of A/D converter 44 between times t2 and t3. This average Logic 42 continues to monitor the output of controller 46, 

represents the average difference D between program seg- waiting for the next trigger signal. In the absence of a return 

ment PI and reference level LI for the period t2-t3 (see the cue tone, insertion controller 46 signals advertisement video 

FIG. 4 graph). Using this average difference D and reference source 24V and audio source 24A to start playing commer- 

level LI, in calculate and store STEP 75, logic 42 calculates 45 cial C2 at the end of commercial CI. At this time, controller 

a target level and stores that value in memory 43. 46 transmits a next-commercial trigge r to logic 42 which 

It is noted that many conventional ad-insertion equipment exits the NO path of decision STEP 78 and proceeds to 
know within a minute or two when a commercial is to be trigger STEP 79. In response to receiving the next- 
inserted, i.e., when to expect to receive a pre-roll cue tone. commercial trigger, logic 42 increments index n, m index 
Because of this, the program segments may be monitored 50 STEP M » and ^ process returns to calculate STEP 76. In 
and averaged over a time period other than period t2-t3, calculate STEP 76, logic 42 calculates the new audio level 
which usually lasts only five to eight seconds. For example, error E2, which represents the small additional amount that 
the average difference D may be deterrnined over a one or attenuators 55 and 56 must be adjusted, in adjust STEP 77, 
two minute period just prior to the advertisement block. to move the level of commercial C2 down to the target level. 

As discussed above, at time t2 insertion controller 46 55 When controller 46 receives a return cue from program 

triggers advertisement video source 24V and audio source video source 21V at time t4, it switches transmission switch 

24A to initiate the advertising process, i.e., to turn on the 25 back to the normally up position of FIG. 3, thereby 

tape player to get it up to speed or, in the case of a digital disconnecting advertisement video source 24V, and audio 

system, to retrieve digital data from the appropriate file source 24A and reconnecting program video source 21V and 

servers. Further upon receiving the transfer cue tone at time 60 audio source 21A to modulator 30. Controller 46 also 

6, insertion controller 46 switches transmission switch 25, transmits return trigger to logic 42. In response, logic 42 

via lines 34, to the down position as viewed in FIG. 3. At this exits decision STEP 78 on the YES path and returns the 

point, the advertisement video and audio channels are process back to monitor STEP 70. 

switched onto output lines 33 of switch 25 for transmission FIG. 5B, which is similar to FIG. 5A, depicts a modified 

to the subscribers. As such, adder 39 now monitors and adds 65 normalization process in which the audio level offset of each 

the left and right channels of advertisement audio source commercial is stored on its first playing so that upon 

24A. subsequent playing, the volume correction of that commer- 
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rial may be set to an approximately correct value without the a channel is shared during different times. During the day the 

need to play, monitor and measure the commercial's level. channel might be used for educational programming and 

In performing the FIG. 5B process on the program- during the evening the channel might be used for pay 

commercial sequence of FIG. 4, logic 42 performs STEPS programming. This necessitates the switching of inputs to 

70-75 to obtain the target level. At this point, logic 42 5 the receiver channel. If the audio levels from the two 

determines, in decision STEP 69, if the commercial about to different sources are different, which is the rule rather than 

be played, e.g. commercial CI, has been previously played the exception, CTV system 20\ an alternate embodiment 

To do this, logic 42 looks up table entries in memory 43 that normalizes these differences. 

have been previously stored, listing commercial identifica- FIG. 6 illustrates CTV system 20' which monitors the 

tion (ID) data, corresponding audio level errors Ep and its 1Q level of a program segment, e.g. program segment PI, and 

related target level. Controller 46 provides logic 42 with the adjusts its level to match the nominal commercial reference 

appropriate ID data. If this is the first playing of commercial level LI. Additionally, the commercials have their levels 

CI, the process exits STEP 69 along the NO path to calculate corrected, if necessary, as they begin to play. The advantage 

error El, in calculate STEP76, and proceeds to adjust STEP of this embodiment is that, if a program level changes as a 

77 to adjust attenuators 55 and 56. Next, logic 42 stores, in 15 result of uplink errors or system gain changes, the level 

store STEP 68, appropriate table entries in memory 43, correction subsystem automatically corrects the match for 

namely, the ID data for commercial CI, the corresponding these errors and/or changes. 

error Ep equal to the current error El and its related target addition to the elements that make up CTV system 20, 

level. If logic 42 receives a return trigger in decision STEP cry system 20* has three attenuators 65-67 in the program 

78, it exits the YES path and returns to monitor STEP 70 via 2Q audio path. More specifically, CTV system 20' shows attenu- 
index STEP 86 which increments index n. However, if logic a tors 65, 66 and 67 connected in the respective left, right and 
42 should receive a next-commercial trigger in trigger STEP SAP channels of program audio source 21A. Attenuator 

79, the process returns to decision STEP 69, via index STEP control lines 38 connect logic 42 to the control terminals of 
80 which increments index n, and the process repeats for program attenuators 65, 66 and 67. By adding program 
commercial C2. 25 attenuators 65-67, CTV system 20' functions to normalize 

However, if the commercial to be played has played errors in the program audio levels. As such, CTV system 20' 

previously, the process exits the YES path from decision automatically normalizes unwanted variations in program 

STEP 69. Logic 42 uses the commercial's ID data to levels that may be caused by, for example, inconsistent 

accesses the appropriate table entry in memory 43. In adjustments made by those who operate CTV head-ends, or 

calculate STEP 63, logic 42 calculates error Er using the 30 audio gain variations in the head-end that change with time 

current target level, error Ep, and its related target level. or temperature. 

Logic 42, in adjust STEP 64, next adjusts attenuators 55 and The audio level norrnalizing technique of CTV system 20* 

56 using audio level error Er. The process then moves to parallels that of CTV system 20, except that system 20* 

decision STEP 78 and proceeds further in the manner contains enough controls to allow the audio of program 

described above. 35 segments PI and P2, and the audio of commercials CI and 

Pre-correcting the loudness setting for the commercials, C2 to be set to reference level LI. This process will now be 

as just described with respect to FIG. 5B, requires that tables described with reference to FIG. 6, the curves in FIG. 7 and 

be constructed for all commercials in the system. Since the FIG. 8 flow chart. At the start, monitor STEP 90 initiates 

insertion controller 46 usually provides all commercials with the monitoring of the difference values outputted by A/D 

ID data uniquely associated with that commercial, logic 42 40 converter 44. In define STEP 91, logic 42 sets the stored 

can readily organize the table using that ID data. In the value of the target level to be equal to reference level LI, 

preferred embodiment, the table is duplicated for each which FIG. 7 shows to be within the range of commercial 

channel on which the commercial airs. This will accommo- CI. Upon receiving pre-roll trigger, in trigger STEP 81, 

date any differences in the gain of comparison amplifier 32 logic 42 performs store STEP 82, trigger STEP 83 and 

or other errors from channel to channel. 45 calculate STEP 84 to find the average difference D at the 

Many other modifications and variations are possible in output of A/D converter 44 over time period t2-t3 (or other 

the light of the above teachings. For example, in order to time period as described above). Logic 42 uses this average 

allow the above-described level setting activities to proceed difference D, which represents the difference between the 

without undue computational stress being required of logic target level (equal to reference level LI) and the audio level 

42, level detector 40 is preferably configured to output the 50 of the last portion of program segment PI, as an audio level 

measured level differences in decibels (dB), a well-known error signal for adjusting program attenuators 65 and 66. 

operation which converts ratio (division) operations into FIG. 7 shows the original position of program segment P2' 

addition. Those skilled in these arts can readily design the being shifted to the new position of program segment P2 as 

output voltage of level detector 40 to be proportional to the a result of these adjustments. 

dB level of the audio differences. FIG. 13, to be described 55 After adjusting program attenuators 65 and 66, in adjust 

below in detail, includes a specific implementation of level STEP 87, logic 42 exits the NO path of decision STEP 88 

detector 40 with a dB output Attenuators 55-57, therefore, and proceeds to set STEP 89. At this point, logic 42 stores 

attenuate the audio signals in a conventional dB relationship in memory 43 the output of A/D converter 44 as audio level 

with respect to errors En. Although the conversion to dB error En, which at this point in the present example equals 

representation may be performed intrinsically in level detec- 60 error El. Logic 42 adjusts, in adjust STEP 92, commercial 

tor 40, as depicted herein, it is possible and often preferable attenuators 55 and 56 using audio level error El. Because it 

to take the functions of audio reference level circuit 41 and was assumed that commercial CI was recorded substantially 

comparison amplifier 32 into the digital domain by digitiz- at the nominal advertisement recording reference level LI, 

ing the output of level detector 40. audio level error El equals zero, making attenuator adjust- 

A source of error in the audio level output to the sub- 65 ments at this point unnecessary. In the absence of receiving 

scribers occurs when the input sources at receiver 21 change a return trigger, in trigger STEP 93, logic 42 follows the NO 

from time to time. Such will be the case, for example, when path to trigger STEP 95 via index STEP 94 where index n 
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is incremented. Upon receiving the next-commercial trigger sync tip amplitude. It is known to those skilled in the art how 

from controller 46, in trigger STEP 95, logic 42 stores in to measure sync amplitude. In addition, if a second set of 

memory 43 audio level error E(n+1) as being equal to the level control attenuators are placed in the program video 

current output of A/D converter 44. At this point in the source output, such as in lines 22 of FIGS. 3 and 6, they 

present example, logic 42 stores audio level error E2 as the 5 could be used to correct for incorrect video level coming 

difference between the audio level at the start of commercial from that program video source. 

C2 and the target level, Le., reference level LI. As such, FIGS. 9 and 10 show yet another embodiment. Here CTV 

adjust STEP 92 performs an adjustment to advertisement system 20" corrects audio-level differences that may be 

attenuators 55 and 56 so that commercial C2 will play at or introduced by audio-level adjustments contained in conven- 

near reference level LI. This process is repeated for sue- tional modulator equipment A control device on a conven- 

ceeding commercials. tional modulator, e.g., modulator 30, is frequently labeled 

When logic 42 receives a return trigger from controller "DEVIATION " because it controls the deviation of the 

46, at time t4 in decision STEP 93, it returns the process to sound carrier: the greater the audio level the greater the 

trigger STEP 81 to await reception of the next pre-roll deviation. If the operators set the modulator deviations 

trigger at, for example, time t5. This sequence takes place 15 inconsistently from one channel to the next, the associated 

via the YES path of decision STEP 93. When controller 46 audio levels will change when subscribers are tuning chan- 

transmits the next pre-roll trigger at time t5, logic 42 nels. Thus, controlling the deviation (sound level) between 

proceeds through STEPS 81-84, thereby obtaining a new channels of a multi-channel CTV system 20", as depicted in 

average difference D for the final portion of program seg- FIGS. 9 and 10, creates further normalization problems, 

ment P2 over time period t5-t6. FIG. 7 shows the new 20 The normalization technique of CTV system 201" paral- 

average difference D, over period t5-t6, to be significantly lels that of CTV system 20' of FIG. 6, with a notable 

smaller than the difference D* that would have occurred exception that level detector 40 derives its audio input 

absent the prior adjustments made to program attenuators 65 signals form different points. CTV system 20' derives its 

and 66 in adjust STEP 87 at time t3. audio level information from the individual subscriber 

The level adjusting procedures so far described measure 25 channels, viz., lines 33. On the other hand, CTV system 20" 
the average level of the program segment over a short time derives its audio level information from a point common to 
period at the end of the segment, e.g., between the pre-roll all subscriber channels. More specifically, while CTV sys- 
and transfer cues. In the FIG. 7 scenario, this procedure tern 20* uses the left and right audio channels outputted by 
results in program segment PI running at a level much lower receiver 21 and ad-insertion system 24 via adder 39, system 
than target level LI. It is also noted that in the embodiments 30 20" derives the audio from the main subscriber lines 101. 
so far described, logic 42 adjusts the attenuators, e.g., Level processor circuit 26' of system 20", employs tune- 
attenuators 55-57 and/or 65-67, within a relatively short able demodulator 60 to selectably monitor the audio outputs 
time period that would normally go undetected by the of the various subscriber channels. The audio input of 
viewer. For example, after the average level of program demodulator 60 connects to the main subscriber lines 101 
segment PI in FIG. 7 is detected over time period t2-t3, the 35 via directional coupler 61. When monitoring a particular 
level of commercial CI is adjusted (if necessary) within a channel, demodulator 60 tunes to the output of the appro- 
short period that should go undetected by the listening or priate channel modulator, e.g., modulator 30A or an equiva- 
viewing audience. However, these functions can be modified lent audio modulator (not shown) in one of the other 
in some situations by measuring the average program levels channels 100. A preferable implementation of demodulator 
over one or more other time periods as the program segment 40 60 includes a conventional agile television demodulator with 
runs while making small periodic attenuator adjustments to a calibrated audio output and a tuner suitable for control by 
gradually bring the program or commercial level into line controller 46. In the television system used in North America 
with the target level. Using this modified procedure with the (NTSC), the audio on main subscriber lines 101 deviates the 
FIG. 7 scenario, the first program segment PI can be audio carrier by 25 kQohertz (KHz). In this regard, demodu- 
gradually brought into line with target level LI well before 45 lator 60 is preferably calibrated such that its output voltage 
receiving the pre-roll cue at time t3. The application of this equals a known value for this 25 KHz deviation. As such, 
steady level adjustment process will be described below in this known value corresponds to audio reference level LI. 
greater detail with respect to the embodiment of FIGS. 9 and Because demodulator 60 produces a monaural sum signal, 
10. level processor circuit 26' does not include an adder similar 

It has been noted above that the teachings of this invention 50 to adder 39 of systems 20 and 20 f . 
apply equally well to other types of signals, including video, For CTV system 20". the cue tones received by commer- 
data, control, etc. Video levels are usually easier to control cial insertion controller 46 transmit on line 23 from program 
than are audio levels because a well defined relationship video source 21V and on lines 23* from similar equipment 
exists between the desired picture information and its sync (not shown) in the other channels 100. In addition to timing 
signal level. Because the sync signal level is normally 55 information, these cue tones include channel tuning infor- 
consistent from one program segment to all others, it may be mation which insertion controller 46 uses to direct the 
easily used to set video level. However, a few cases exist in ad-insertion process of ad-insertion system 24'. In this 
which it is desirable to automatically set the level of the regard, ad-insertion system 24' is a conventional system 
video signal other than using the sync signals. One important having multiple channels for simultaneously inserting corn- 
instance occurs when different video sources are being 60 mercials in more than one subscriber channel via lines 27 
switched to a common output channel, or when video system and 27*. Additionally, insertion controller 46 uses the chan- 
gain changes over time. nel tuning information to automatically tune audio demodu- 

Therefore, if level control attenuators are placed in the lator 60 from one subscriber channel to another. When a 

output of an advertisement video source, such as in lines 27 program segment, e.g., program segment PI of FIG. 2, nears 

of FIGS. 3 and 6, they can be used to match the video level 65 its end on any channel, commercial insertion controller 46 
of a commercial to that of the program video source. A tunes audio demodulator 60 to that channel for performance 

detector would detect the level of the video by measuring of its appropriate normalization process. 
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The audio level normalization control loop for each known that the pilot level on stereo channels is not auto- 
channel in CTV system 20" includes attenuators 55-57 and maticaUy set correctly. After a stereo encoder generates such 
65-67, and stereo encoder and audio modulator 30 A. These pilot levels, they are modulated onto the audio carrier. In 
items will be unique to each channel. This normalization some equipment, the adjustment of the pilot level is a factory 
control loop also includes level processor circuit 26* which 5 calibration, and in other equipment is a field adjustment. It 
comprises demodulator 60, level detector 40, amplifier 32, is possible to add to level processor 26\ for example, a 
A/D converter 44 and logic 42. These items are shared by all channel tuned to the standard 15.734 KHz pilot frequency, 
of the channels. Level processor circuit 26' normalizes the The level of the pilot could then be measured. Due to the 
program audio from all channels, including other channels way equipment is normally built, it will not be readily 
100, and commercial audio being routed through all chan- possible to automatically correct the pilot level, but a visual 
nels from ad-insertion system 24'. As such, CTV system 20" or audible indicator could notify an operator of a problem, 
offers a closed-loop control of audio level that includes As SU gg e sted above, those skilled in the art are aware that 
normalization of mismatched levels due to several sources measuring audio level is extremely difficult, in part by virtue 
including the audio modulators. of me fact ^ norma i programming includes constantly 

Having the elements of tuner processor circuit 26" com- varying audio levels, possibly covering a very large dynamic 

mon to all channels both niiiiirnizes cost and promotes more range For example, a program may include a loud argument 

consistency from one channel to another The use of com- between ^ individuals, followed by an almost silent scene 

mon equipment does however, preclude the measurement of whefl Qne Qf mem mns QUt of me house and WQods 

the audio level just before a commercial on a plurality of mch ^ ^ me deviation 

channels. Consequently a queumg routine must be used for ^ 

defining priority channels as the insertion process proceeds. 20 & & > b & 

Hie present invention contemplates that insertion controller ™« sce ° e fan where toectQr » «** c J ud *? cl * 

46 be programmed to define and select priority channels to P laced * Adjusting to a louder signal (higher deviation) 

which audio demodulator 60 is tuned just prior to a com- during the running scene, is not appropriate. An algorithm 

mercial. As mentioned above, conventional insertion con- for gradually adjusting audio level (and for determining the 

toilers know the time that commercials will play within a 2 5 1113100 level of a commercial) is programmed into the 

minute or two in most cases. As such, some channels can be software of logic 42 and controller 46. Although various 

monitored between commercials and their audio deviation algorithms are possible, one that experience teaches may be 

optimized for average audio over a period other than the most appropriate with respect to system 20" of FIGS. 9 and 

short period just before the commercial (e.g., other than time 10 will now be described. 

period t2-t3). In practice, this solution will produce sufficient 30 During program content, loudness is periodically moni- 

normalization in most cases because audio levels do not tored over several periods and attenuators 65 and 66 are 

change particularly fast adjusted while a program segment runs. A one minute (for 

Further, when commercials begin simultaneously on sev- example) timer in logic 42 starts each time level detector 40 

eral subscriber channels (a common occurrence), insertion indicates that the audio level has reached reference level LI. 

controller 46 selects these channels sequentially according 35 If the audio level reaches reference level LI more than, for 

to pre-defined priority routines. Although insertion control- example, five times during a minute, the level is turned down 

ler 46 may assign tuning priorities for audio demodulator 60 by increasing the attenuation of attenuators 65 and 66. The 

according to any number of queuing routines, including a adjustment is done in a minimum control increment so as not 

random assignment routine, a preferred priority routine for to be audible to the listener. For example, an increment of 

making priority assignments is as follows: the channels to be 40 0.5 dB is suitable. On the other hand, if the audio does not 

given the highest priority are chosen from a random selec- reach the maximum level at all during the minute, the 

tion of the channels whose level offsets are unknown (i.e., volume is increased by adjusting attenuators 65 and 66. The 

they have never been measured and stored); followed by an volume is increased by the same minimum amount which is 

ordered selection of the remaining channels based on the not detectable by the listener. If another minute elapses 

length of time that its commercial has last played, with those 45 without the audio level reaching the reference level LI, logic 

having the longer periods given the higher priority. Using 42 adjusts attenuators 65 and 66 so that the audio level is 

this priority routine, commercials that have not yet played again increased by that minimum amount Such an algo- 

and, therefore, have not yet been normalized are given top rithm will result in the volume level ultimately being set to 

priority. Because there is more opportunity for equipment reference level LI as close as possible, 

level settings to drift over time, priorities are next assigned 50 Another algorithm would monitor the level with respect to 

based on the length of time since the commercial last played; reference level LI, and any time the audio level reaches 

the one with the longest time period of non-play assigned the reference level LI, logic 42 adjusts attenuators 65 and 66 so 

top priority. When there is a conflict in priorities, the routine that the audio level is reduced almost instantaneously until 

makes random assignments from those involved in the it drops below reference level LI. If reference level LI is not 

conflict. 55 reached again for some length of time, such as one minute, 

Quality control functions are possible for systems of the logic 42 adjusts attenuators 65 and 66 so that the audio level 

type shown in FIGS. 9 and 10. It is known by viewers of increases by the same small increment as above. The 

cable television that the audio level from one channel to increase is repeated at intervals of a minute or so, until 

another is not always consistent This is usually caused by reference level LI is again reached. This type of algorithm 

head-end set-up errors. The set-up errors are, in turn, caused 60 forms a fast attack, slow decay volume control strategy, 

by excessive work load or insufficient training of personnel, which is often preferred for volume control downstream of 

or by limitations in equipment By sequentially monitoring where the artistic qualities of the program are set. In general 

the level on all channels, the audio level of all channels can this is the type of algorithm that is favored in this circum- 

be made the same, solving a long-standing complaint against stance. 

conventional cable systems. 65 In certain circumstances the artistic qualities may be 

Another quality control function is contemplated using changed by bringing the volume up more rapidly if it does 

the standard pilot level on stereo channels. It is generally not reach reference level LI. This changes the control 
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strategy to what is often called "compression." It is generally 
not preferred unless typical listening conditions are different 
from those anticipated during creation of the program. 

Each audio transmission, be it part of a commercial or an 
entertainment program, has a characteristic audio-level sig- 5 
nature that may be derived from level detector 40. FIGS. 11 
and 12 illustrate a verification system that uses such audio- 
level signatures to automatically determine if the content of 
the transmission corresponds to the content that was 
intended. This function is of particular importance in adver- 10 
rising where it is desirable to verify that a particular com- 
mercial was transmitted at the proper time on the proper 
channel. 

Referring now to FIGS. 11 and 12, verification system 
109 includes digital sampler 110 which outputs a unique 15 
series of audio level samples over predefined time periods 
for each commercial. Digital processor 111 receives these 
samples from sampler 110 while receiving commercial ID 
data and trigger pulses from insertion controller 46 in 
monitor STEP 120 and data STEPS 121. Processor 111 also 20 
connects to memory 112, correlation detector 113 and alarm 
114. Generally, processor 111 stores the signatures for all 
commercials that are to be played in a signature table in 
memory 112, via the YES path of decision STEP 122 and 
store STEP 123. These signatures are stored, along with their 2 5 
commercial ID data, the first time that each commercial is 
played by the CTV system, or, alternatively, they may be 
imported to memory 112 from data obtained when the 
commercials are recorded. Processor 111 performs corre- 
sponding auto correlations on each of the signatures and 3 q 
stores the auto correlation results in the signature table. Still 
further, for each commercial, processor 111 finds the stan- 
dard deviation for the set of samples that make up its 
signature. These standard deviation values, which are mea- 
sures of the extent that the set of samples of each signature 35 
deviates from its mean, are also calculated and stored in the 
signature table during store STEP 123. 

Later, when a particular commercial plays, processor 111 
stores, in store STEP 125, its commercial ID data and 
signature in a verification table to be used for verification. To 40 
verify that a particular commercial that was played, say 
commercial CX, was the desired commercial, say commer- 
cial CI, processor 111 retrieves the CX signature of the 
unknown commercial from the verification table. Processor 
111 performs, in calculate STEP 126, a correlation of the 45 
unknown signature CX with the known signature CI. Pro- 
cessor 111 then compares, in decision STEP 128, the auto- 
correlation Cl/Clwith the correlation Cl/CX and the stan- 
dard deviation of commercial CI. If the autocorrelation 
Cl/Cland correlation Cl/CX are of comparable value and 50 
high as compared to the standard deviation of commercial 
CI. then there is a strong likelihood that commercial CX 
corresponds to commercial CI. However, if the autocorre- 
lation Cl/Clis high compared with the standard deviation of 
commercial CI while correlation Cl/CX is low compared 55 
with the standard deviation of commercial CI, processor 111 
fails to verify commercial CX and activates alarm 114. Of 
course, the autocorrelation results of the stored signatures 
need not be stored in signature table, but instead calculation 
STEP 126 may include the process of obtaining both the 60 
desired autocorrelation Cl/Cland the apparent cross- 
correlation Cl/CX before obtaining verification via verify 
STEP 128. 

To maximize system performance, it is contemplated 
further that level detector 40 preferably measure the sub- 65 
jective loudness of the audio. This contrasts with other 
measuring options, such as measuring peak audio levels. In 



the instant invention level detector 40 preferably obtains a 
metric of the audio level that correlates well with the way 
humans subjectively perceive the loudness of audio program 
material. 

Measuring the subjective loudness of an audio signal is a 
difficult undertaking. One of the earlier known standards for 
doing this is the volume unit (VU) meter developed coop- 
eratively by the telephone and broadcast industries in 1939, 
and still in use today. The VU meter standard was developed 
based on the capabilities of the D'Arsonval meter move- 
ments available at that time. That the standard remains in use 
today is more of a testament to the difficulty of measuring 
subjective loudness, than it is to the validity of the standard 
as a measure of loudness. Those skilled in these arts nor- 
mally use this standard only for general guidance while 
imposing their own subjective judgment as to how loud a 
signal sounds. 

The European Broadcast Union (EBU) has for some years 
used a peak reading meter to measure the loudness of a 
signal. Some authorities propose that the EBU method is 
superior to using a VU meter because it relates nicely to the 
needs of engineers monitoring the peak level of a signal for 
transmission purposes. However, researchers have sought 
better methods for measuring an audio signal that correlate 
with subjective loudness. FIG. 13 shows a schematic of a 
preferred circuit suitable for use as level detector 40 for 
measuring subjective loudness of an audio signal. 

Level detector 40 is a particularly critical circuit, in that 
it must produce a voltage that varies proportionally to the 
subjective loudness heard by a typical listener. Level detec- 
tor 40, as depicted in FIG. 13, combines a close approxi- 
mation of a standard A-weighting curve, such as shown in 
FIG. 14, with a factor reflecting the observation that fast, 
transitory sound is not perceived as loudly as would be the 
same sound if it lasted longer. These concepts are discussed 
in the following publications: Benson, Audio Engineering 
Handbook, 1988, pp 1-38 to 1-39; and Burden, et aL, "A 
Different Approach to the Old Problem of Audio Level 
Monitoring," 84th Convention of the Audio Engineering 
Society, March 1-4, Paris, 1988, pp 1 to 8. 

The A-weighting curve plots response vs. frequency such 
that its values closely resemble the typical response of the 
human ear. Filtering circuits having a response that approxi- 
mates the A-weighting curve have been used in audio level 
measurements as a means of pre-filtering a signal such that 
subsequent detection of volume level will be roughly equal 
to the frequency response exhibited by the human ear. 
Further, it is known that humans generally perceive loudness 
partially as a function of duration of the sound, with tran- 
sitory signals sounding progressively louder until the dura- 
tion exceeds about 200 micro-seconds. Beyond this period, 
the perceived loudness no longer varies as a function of 
duration. 

FIG. 13 depicts the specific circuit elements for an imple- 
mentation of level detector 40 that was tested and compared 
to an A-weighting curve as shown in FIG. 14. The circuit 
schematic of FIG. 13 places parameter values for the various 
elements adjacent the element symbols. In FIG. 13, capaci- 
tance values are given in microfarads and resistance values 
are in ohms. The reference markings adjacent diodes D1-D4 
and amplifiers U1A, U1B, U1C, U1D and U2Aare conven- 
tional identifications of these elements. Transistor Ql is a 
conventional NPN transistor. 

The audio signal monitored by level detector 40 appears 
at input terminal 140. Amplifier U1A and the associated 
elements constitute an active band-pass filter 141. Generally, 
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filter 141 is a conventional circuit with the addition of 
capacitor C3 which creates a portion of the rolloff required 
by the A-weighting curve. From filter 141, the signal passes 
to low-pass filter 142 comprising amplifier U1B and the 
associated components. Again, this circuit is known to those 
skilled in the art High-pass filter 143, comprising capacitor 
C5 and resistor R8, provides a portion of the rolloff at low 
frequencies required by the response of the A-weighting 
curve. 

The output of high-pass filter 143 passes to full wave 
rectifier 144, consisting of amplifiers U1C and U1D and 
their associated components. Again, this circuit is familiar to 
those skilled in the art. The output at pin 8 of U1C is a full 
wave rectified representation of the audio signal at input 
terminal 140 after it has passed through filters 141-143. The 
R-C circuit, formed by resistor R16 and capacitor C6, limits 
the attack time of the circuit to limit the effect on the audio 
level of a momentary high level sound. The time constant of 
the R-C circuit is much shorter than the maximum duration 
proposed by the Benson publication cited above. This is 
done to allow a certain degree of fast attack characteristic to 
pass to the measuring facility. 

Finally, dB circuit 145, comprising amplifier U2A and 
associated components, converts the output of rectifier 144 
to decibel or logarithmic representation. This circuit is also 
known to those skilled in the art Output terrninal 146 passes 
the signal to comparison amplifier 32. FIG. 14 shows the 
A-weighting curve as a solid line and the test values of the 
filter response of level detector 40 as X's. As can be seen 
from inspection, there is a close correlation between the 
responses of the A-weighting curve and the test results of the 
FIG. 13 implementation of level detector 40. 

It is to be understood, therefore, that within the scope of 
the appended claims, the invention may be practiced other- 
wise than as specifically described. 

What is claimed is: 

1. A signal processing system having a normalized output 
signal comprising: 
a first signal source; 
a second signal source; 

a signal combining means connected to said first and 
second signal sources for forming an output signal by 
linking signal segments derived from said first and 
second signal sources into a series of said signal 
segments; 

a level processor means connected to said signal combin- 
ing means for feteimining a level of intensity of said 
output signal, wherein said level processor means gen- 
erates a target level and an error level related to the 
difference between said target level and said level of 
intensity of said output signal and, wherein said level 
processor means includes level storage means for stor- 
ing said error level and the corresponding source of 
said signal segment; and 

a level adjusting means connected to at least one of said 
signal sources and responsive to said level processor 
means for adjusting a level of intensity of said signal 
segments from said at least one of said signal sources 
such that said level of intensity of said output signal is 
normalized, wherein said level adjusting means adjusts 
said level of intensity of said signal segments as a 
function of said error level, and further wherein said 
level adjusting means adjusts said level of intensity of 
said signal segments as a function of corresponding 
ones of said error level stored in said level storage 
means. 
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2. The system of claim 1 wherein said target level is a 
function of said level of intensity of said output signal for 
signal segments derived from said first signal source. 

3. The system of claim 1 wherein said level processor 
means includes a reference level means for generating said 
target level at a fixed predetermined value. 

4. A signal processing system having a normalized output 
signal comprising: 

a first signal source; 
a second signal source; 

a signal combining means connected to said first and 
second signal sources for forming an output signal by 
linking signal segments derived from said first and 
second signal sources into a series of said signal 
segments; 

a level processor means connected to said signal combin- 
ing means for determining a level of intensity of said 
output signal, wherein said level processor means fur- 
ther includes a reference level means for generating a 
predetermined fixed reference level, and said level 
processor means determines an error level related to the 
difference between said level of intensity of said output 
signal and said fixed reference level; and 

a level adjusting means connected to at least one of said 
signal sources and responsive to said level processor 
means for adjusting a level of intensity of said signal 
segments from said at least one of said signal sources 
such that said level of intensity of said output signal is 
normalized and, said level adjusting means further 
connected to said first and second sources for adjusting 
said level of intensity of said signal segments at cor- 
responding ones of said signal sources. 

5. The system of claim 4 wherein said level processor 
means determines said level of intensity of said output signal 
for signal segments derived from said first signal source, and 
said level adjusting means adjusts said level of intensity of 
said signal segments at said second signal source as a 
function of said level of intensity of a preceding signal 
segment from said first signal source. 

6. A signal processing system having a normalized output 
signal comprising: 

a first signal source, wherein said first signal source 
includes cue means for marking borders of said signal 
segments; 

a second signal source, said second signal source being 
responsive to said cue means for generating signal 
segments and for causing said signal combining means 
to link said signal segments at said borders; 

a signal combining means connected to said first and 
second signal sources for forming an output signal by 
linking signal segments derived from said first and 
second signal sources into a series of said signal 
segments; 

a level processor means connected to said signal combin- 
ing means for determining a level of intensity of said 
output signal, wherein said level processor means gen- 
erates a target level and an error level related to the 
difference between said target level and said level of 
intensity of said output signal; and 

a level adjusting means connected to at least one of said 
signal sources and responsive to said level processor 
means for adjusting a level of intensity of said signal 
segments from said at least one of said signal sources 
such that said level of intensity of said output signal is 
normalized, wherein said level adjusting means adjusts 
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said level of intensity of said signal segments as a 
function of said error level. 

7. The system of claim 6 wherein said level adjusting 
means adjusts said level of intensity of said signal segments 
at said borders of said signal segments. 

8. The system of claim 6 wherein said first and second 
signal sources include audio signals and said level of inten- 
sity corresponds to the loudness of said audio signals. 

9. A signal transmission system for producing a composite 
output signal formed by linking signals from a plurality of 
signal sources comprising: 

a first signal source means for generating a series of first 
signal segments and cue tones indicating the borders of 
said first signal segments; 

a second signal source means connected to said first signal 
source means and responsive to said cue tones for 
generating a series of second signal segments; 

a signal combining means connected to said first and 
second signal source means for forming said composite 
output signal by alternately linking said first signal 
segments with said second signal segments; 

a level processor means connected to said signal combin- 
ing means for determining a level of intensity of said 
composite output signal; and 

a level adjusting means connected to at least one of said 
signal source means and responsive to said level pro- 
cessor means for adjusting a level of intensity of signal 
segments from said at least one of said signal source 
means such that said level of intensity of said output 
signal is normalized. 

10. The system of claim 9 wherein said first signal 
segments and said second signal segments include audio 
signals and said level of intensity corresponds to the loud- 
ness of said audio signals. 

11. The system of claim 10 wherein said level processor 
means determines the loudness of a portion of said first 
signal segments, and said level adjusting means adjusts the 
loudness of said second signal segments as a function of said 
loudness of said portion of said first signal segments. 

12. The system of claim 11 wherein said level processor 
means generates a target loudness level and an error level 
related to the difference between said target loudness level 
and the loudness of said output signal, and said level 
adjusting means adjusts said loudness of said output signal 
as a function of said error level. 

13. The system of claim 12 wherein said target loudness 
level is a function of said loudness of portions of said first 
signal segments. 

14. The system of claim 12 wherein said level processor 
means includes a reference level means for generating said 
target level at a fixed predetermined value. 

15. The system of claim 12 wherein said level processor 
means includes level storage means for storing said error 
levels, and said level adjusting means adjusts the loudness of 
said signal segments as a function of corresponding ones of 
said error levels stored in said level storage means. 

16. The system of claim 12 wherein said level adjusting 
means is connected to said first and second source means for 
adjusting said loudness of said first signal segments and said 
second signal segments. 

17. The system of claim 12 wherein said first signal source 
means includes cue means for marking borders of said first 
signal segments, and said second signal source means being 
responsive to said cue means for generating said second 
signal segments and for causing said signal combining 
means to link at least one of said second signal segments to 
said first signal segments at said borders. 
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18. The system of claim 17 wherein said composite output 
signal includes a plurality of said second signal segments 
linked between successive ones of said first signal segments, 
said level processor means determines the loudness of each 
of said plurality of said second signal segments, and said 
level adjusting means adjusts said loudness for each of said 
plurality of said second signal segments. 

19. The system of claim 18 wherein said level processor 
means includes signal signature means for storing predeter- 
mined loudness signatures for said signal segments, and for 
correlating a plurality of samples of said loudness for one of 
said signal segments with a corresponding one of said 
signatures to verify the transmission of said one of said 
signal segments. 

20. The system of claim 19 wherein said level processor 
means includes loudness detector means having a frequency 
response that substantially resembles the typical response of 
a human ear. 

21. A signal transmission system for transmitting an 
output signal comprising: 

a plurality of signal channels, each said channel compris- 
ing: 

a first signal source means for generating a series of 
first signal segments and cue tones indicating the 
borders of said first signal segments; 
a second signal source means connected to said first 
signal source means and responsive to said cue tones 
for generating a series of second signal segments; 
a first signal combining means connected to said first 
and second signal source means for forming a chan- 
nel output formed by alternately linking said first 
signal segments with said second signal segments; 
a second signal combining means connected to each of 
said first signal combining means for combining said 
channel outputs for simultaneous transmission; 
a level processor means connected to said second signal 
combining means for detenmning a level of intensity of 
signals in one of said channel outputs; and 
a level adjusting means connected to at least one of said 
signal source means in each said channel and respon- 
sive to said level processor means for adjusting a level 
of intensity of said signal segments from said at least 
one of said signal source means in one of said channels 
such that said level of intensity of said channel output 
is normalized. 

22. The system of claim 21 wherein each said first signal 
combining means includes a modulator means for modulat- 
ing said channel output onto a carrier signal, and said level 
processor means includes a tunable loudness detector means 
for selectively detecting loudness for each of said channel 
outputs. 

23. The system of claim 22 wherein said level processor 
means determines said level of intensity of said signal 
segments of one of said channel outputs, and said level 
adjusting means adjusts said level of intensity of said signal 
segments for the other of said channel outputs as a function 
of said level of intensity of said one of said channel outputs. 

24. A signal processing method for producing a normal- 
ized output signal comprising: 

generating a first signal from a first signal source; 

generating a second signal from a second signal source; 

combining said first and second signals to form an output 
signal by linking signal segments from said first and 
second signals into a series of signal segments; 

determining a level of intensity of said output signal 
including determining said level of intensity of said 
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output signal for signal segments derived from said first 
signal and generating a target level and an error level 
related to the difference between said target level and 
said level of intensity of said output signal; and 
adjusting a level of intensity of said signal segments to 
produce said normalized output signal including adjust- 
ing said level of intensity of signal segments from said 
second signal as a function of said level of intensity of 
a preceding signal segment from said first signal, and 
further including adjusting said level of intensity of 
said signal segments as a function of said error level. 

25. The method of claim 24 wherein said determining step 
includes generating said target level as a function of said 
level of intensity of said output signal for signal segments 
derived from said first signal. 

26. The method of claim 24 further including storing said 
error levels and the corresponding source of said signal 
segments, and wherein said adjusting step includes adjusting 
said level of intensity of said signal segments as a function 
of said error levels stored in said storing step. 

27. A signal processing method for producing a normal- 
ized output signal comprising: 

generating a first signal from a first signal source; 

generating a second signal from a second signal source; 

combining said first and second signals to form an output 
signal by linkin g signal segments from said first and 
second signals into a series of signal segments; 

detennining a level of intensity of said output signal, 
wherein said determining step includes generating a 
predetermined fixed reference level, and deteniiining 
an error level related to the difference between said 
level of intensity of said output signal and said fixed 
reference level; and 

adjusting a level of intensity of said signal segments to 
produce said normalized output signal, said adjusting 
step including adjusting said level of intensity of said 
signal segments for corresponding ones of said first and 
second signals. 

28. A signal processing method for producing a normal- 
ized output signal comprising: 

generating a first signal from a first signal source, wherein 
said step of generating a first signal includes generating 
cue tones for marking borders of said signal segments; 

generating a second signal from a second signal source, 
wherein said step of generating a second signal 
includes generating signal segments responsive to said 
cue tones for combining said signal segments at said 
borders; 

combining said first and second signals to form an output 
signal by linking signal segments from said first and 
second signals into a series of signal segments; 
determining a level of intensity of said output signal; and 
adjusting a level of intensity of said signal segments to 
produce said normalized output signal. 

29. A signal transmission method for producing a com- 
posite output signal by linking signals from a plurality of 
signal sources comprising: 

generating a series of first signal segments and cue tones 
indicating the borders of said first signal segments; 

generating a series of second signal segments in response 
to said cue tones; 

forming said composite output signal by alternately link- 
ing said first signal segments with said second signal 
segments; 
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determining a level of intensity of said composite output 
signal; and 

adjusting a level of intensity of signal segments such that 
said level of intensity of said composite output signal is 
5 normalized. 

30. The method of claim 29 wherein said steps of gener- 
ating a series of first signal segments and generating a series 
of second signal segments includes generating audio signals, 

1Q and said level of intensity corresponds to the loudness of 
said audio signals. 

31. The method of claim 29 wherein said oetermining step 
includes detexmining the loudness for a portion of said first 
signal segments, and said adjusting step includes adjusting 

15 the loudness of said second signal segments as a function of 
said loudness of said portion of said first signal segments. 

32. The method of claim 30 wherein said determining step 
includes generating a target loudness level and an error level 
related to the difference between said target loudness level 

20 and the loudness of said composite output signal, and said 
adjusting step includes adjusting said loudness of said output 
signal as a function of said error level. 

33. The method of claim 32 further including storing said 
error levels, and wherein said adjusting step includes adjust- 

25 ing the loudness of said signal segments as a function of said 
error levels stored in said storing step. 

34. The method of claim 29 wherein said adjusting step 
includes adjusting said loudness of said first signal and said 

3Q second signal. 

35. The method of claim 30 wherein said determining step 
includes storing predetennined loudness signatures for said 
signal segments, and correlating one a plurality of samples 
of said loudness for one of said signal segments with a 

35 corresponding one of said loudness signatures to verify the 
transmission of said one of said signal segments. 

36. The system of claim 35 wherein said detemiining step 
includes detecting said loudness as a function of the fre- 
quency response of the typical response of a human ear. 

40 37. A signal transmission method for transmitting an 
output signal comprising: 

generating composite output signals in each of a plurality 
of signal channels comprising: 
generating a series of first signal segments and cues 
45 indicating the borders of said first signal segments; 

generating a series of second signal segments in 

response to said cues; and 
forming a channel output by alternately linking said 
first signal segments with said second signal seg- 
50 ments at said borders in response to said cues; 

combining said channel outputs for simultaneous trans- 
mission; 

determining a level of intensity of signals in one of said 
55 channel outputs; and 

adjusting a level of intensity of each of said channel 
outputs as a function of said one of said channel outputs 
such that said levels of intensity of said channel outputs 
are normalized. 

60 38. The system of claim 37 wherein said step of fonning 
a channel output includes modulating said signal segments 
onto a carrier signal, and said detenriining step includes 
detecting and demodulating a preselected one of said chan- 
nel outputs for selectively determining loudness for each of 

6 5 said channel outputs. 

***** 



